On efficient estimators of the proportion of true null hypotheses in a multiple testing setup
نویسندگان
چکیده
We consider the problem of estimating the proportion θ of true null hypotheses in a multiple testing context. The setup is classically modeled through a semiparametric mixture with two components: a uniform distribution on interval [0, 1] with prior probability θ and a nonparametric density f . We discuss asymptotic efficiency results and establish that two different cases occur whether f vanishes on a set with non null Lebesgue measure or not. In the first case, we exhibit estimators converging at parametric rate, compute the optimal asymptotic variance and conjecture that no estimator is asymptotically efficient (i.e. attains the optimal asymptotic variance). In the second case, we prove that the quadratic risk of any estimator does not converge at parametric rate. We illustrate those results on simulated data.
منابع مشابه
Generalized estimators for multiple testing : proportion of true nulls and false discovery rate
Two new estimators are proposed: one for the proportion of true null hypotheses and the other for the false discovery rate (FDR) of one-step multiple testing procedures (MTPs). They outperform existing such estimators when applied to discrete p-values whose null distributions dominate the uniform distribution and reduce to leading such estimators when applied to continuous p-values. For the new...
متن کاملPost hoc power estimation in large-scale multiple testing problems
BACKGROUND The statistical power or multiple Type II error rate in large-scale multiple testing problems as, for example, in gene expression microarray experiments, depends on typically unknown parameters and is therefore difficult to assess a priori. However, it has been suggested to estimate the multiple Type II error rate post hoc, based on the observed data. METHODS We consider a class of...
متن کاملEstimating the proportion of true null hypotheses, with application to DNA microarray data
We consider the problem of estimating the proportion of true null hypotheses, π0, in a multiple-hypothesis set-up. The tests are based on observed p-values. We first review published estimators based on the estimator that was suggested by Schweder and Spjøtvoll. Then we derive new estimators based on nonparametric maximum likelihood estimation of thep-value density, restricting to decreasing an...
متن کاملAn adaptive significance threshold criterion for massive multiple hypotheses testing
This research deals with massive multiple hypothesis testing. First regarding multiple tests as an estimation problem under a proper population model, an error measurement called Erroneous Rejection Ratio (ERR) is introduced and related to the False Discovery Rate (FDR). ERR is an error measurement similar in spirit to FDR, and it greatly simplifies the analytical study of error properties of m...
متن کاملEstimating the proportion of true null hypotheses when the statistics are discrete
MOTIVATION In high-dimensional testing problems π0, the proportion of null hypotheses that are true is an important parameter. For discrete test statistics, the P values come from a discrete distribution with finite support and the null distribution may depend on an ancillary statistic such as a table margin that varies among the test statistics. Methods for estimating π0 developed for continuo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013